CDS

Accession Number TCMCG075C18167
gbkey CDS
Protein Id XP_017976648.1
Location complement(join(38220178..38220342,38220488..38220778,38220869..38221084,38221214..38221321,38221714..38221875,38222294..38222500,38222584..38222866,38222944..38223122,38223212..38223529,38223793..38223849,38223934..38224217,38224403..38224483,38224698..38225073,38225179..38225438,38226290..38226464,38226785..38226862,38227288..38227350,38227533..38227625,38227821..38227925))
Gene LOC18600446
GeneID 18600446
Organism Theobroma cacao

Protein

Length 1166aa
Molecule type protein
Topology linear
Data_file_division PLN
dblink BioProject:PRJNA341501
db_source XM_018121159.1
Definition PREDICTED: protein ALWAYS EARLY 2 isoform X2 [Theobroma cacao]

EGGNOG-MAPPER Annotation

COG_category K
Description Protein ALWAYS EARLY
KEGG_TC -
KEGG_Module -
KEGG_Reaction -
KEGG_rclass -
BRITE ko00000        [VIEW IN KEGG]
ko00001        [VIEW IN KEGG]
KEGG_ko ko:K21773        [VIEW IN KEGG]
EC -
KEGG_Pathway ko04218        [VIEW IN KEGG]
map04218        [VIEW IN KEGG]
GOs -

Sequence

CDS:  
ATGGCGCCAACAAGAAAATCTAAAAGTGTGAACAAACGATATTCGAGTGTATATGAAGTCTCTCCTGATAAAGATGCCAGTAATTCAAGCAAAAATAAGCCAAAGAAATTAGCTGACAAGTTAGGATCTCAATGGAGCAAGGAAGAAATTGAGCGTTTTTATAAAGCTTATCGAGAGTATGGGAAAGATTGGAAGAAGGTGGCTGCTGCTGTGCATAATAGATCCACTGAAATGGTGGAGGCCCTTTACCTTATGAACCGGGCATATTTATCCCTGCCAGATGGAACAGCTTCTGTGATTGGCCTTATAGCAATGATGACCGATCATTATAGTGTCCTGAGAGGGAGCGATTGTGAAAGAGAGAGTAATGAGCCTTCTGAAATACCCCAGAAAGCTCAAAAGCGCAAGCGGGCAAAGGTTCATCTTGGTGCCTCAAAAGAAGGTGTTGTACAGCCTCAATCAATTGCATCTAGTCAGGGATGCCTATCTTTGTTGAAGAGGGCAGGCCTTAATGGTATTCATCCTCATGCAGTAAGGAAAAGAACTCCTCGGGTTCCTGTTTCATATTCATATAGGAGAAATGACACTGAAAGTTACATTCCACCAAACAAAAGAGTTAAGAAGTCAGACGCTGATGATAATGATGCTGAACATGTTGCCGCATTGACGTTGACTGGGGCATTGCAAAGGGGAGGCTCCCCTCAGGTTTCTCAGACACCTTACAAAAGAGCTGAATGCAGAAGATCCTCACCCGTTCAGAGCTATGATAGGACGTCACCACAACCGGAGACAACTAAGGCCAAGCTTGATGATTCTTCCTACGAATGCTGGATGGAAGGCAGGCCTAGGGGCACGGAACCTGTAATTGGAACTCATGCCAGAGATGCAGACCCCTTGATGGATATGGAAGTTGTTGGTACCATTGAAGGTCATCGAAAGGGAAAGAAATTTTACAGGAAGAAAATGAAAGTTGAAGAAACCAAGAACAATCTCTCTGATGATGGTGGAGAAGCGTGTAGTGGCACTGAAGAAAGAATTAGAGGTAGTACTCTCAAAGGGAAAGTTGATATGGAGATTACCAGTGCAAAAAGTGAACAACTTTCACCATGGAGTCAGAGGAAGAGAAGTAACAAGAAGCTTGTCTTCGGAGATGAAAGCTCTTCCATTGATGCTCTGCTGACATTAGCCAATTTGTCAACGTCAATGTTGCCAACATCAATAATGGAATCTGAATCATCTGTCAAATTGAAAGAGAATAGAATTACACTTGAATCTGTTGACAAGTCTAGTGCCCCTGAAGCTGCATCTACAAGTCATCACAGAGATAATATTAAGCACCTACGGCCAAACGAAAAGGTGCTCGACTCAATCACTGGTGCAGAGGAAGCTACCACTAGGAAACTAAAAGTTGGAAGGAATTCAGCTATGGATGATAATGTTGTTTCTGAGGCAAAACAAAAGCCAGAACCTACCAATAACTCTTGGAAAAGAAAACGCAAATCCTTCAGTTCAAAGATTTCTAATGCAGAAGCTTCAATGGATTCTCATCTCCGACAATCTTTTGATAATGAGGACATGGGTGAAGAAGACAATAAATATCTCACTAAAGGTAAATGTGGTGCTCAATCTTCTGTTCAATCAAGACAATGGAAGTCATTCAGAGTGTCAGAGGATTCCTCTACTAATGATGATCCAAAAATGGCTGGAATTGATTCAGTGGTGTTGACTTCACAAGTTCCTGCACCAAACCCTGTTAGCGTACCACCTAAGCATCAAAGTAGACGTAAAATGAACCTGAGGAGAGCCTTCCTTTCAACAGATAGAAGTTCTTCCAAGTGCACATTGAAAAATCAACCAATCAAGCAGTCTGTCACACAAGACAGACTAAAGGAACAGCTCTCTTCCTGCCTATCATCTAATCTGGCACGAAGATGGTGCTGTTTTGAATGGTTTTACAGTGCTATTGATTATGCTTGGTTTGCTAAAAGGGAGTTTGTTGAGTACCTAAATCATGTCGGACTGGGTCATGTTCCAAGGCTTACTCGTGTTGAGTGGGGTGTCATAAGAAGTTCCCTTGGCAAACCTCGGAGGTTTTCTGAACGCTTTTTACATGAAGAAAGGGAAAAACTTAAACATTATCGGGAGTCTGTGAGACAACATTATTCTCAGCTTCGCGTTGGTGCTAGGGAAGGACTTCCAACGGATCTGGCATATCCTTTATCAGTTGGACAACAAGTAATTGCCATTCATCCCAAAACGAGGGAAGTTCATGATGGAAAAGTACTTACTGTGGACCATGATAGGTGCAGGGTTCAGTTTGATAGTCCTGAACTAGGGGTTGAATTTGTCATGGATATTGATTGCATGCCATTAAATCCGTTGGAAAATATGCCGGAAGCACTTAGGAGACAGAACCTTGCTTTTGATAAATTCTCCGTGACACCTAAACCGTCTCAAGTGAATAGCCATTCAGATTTTGGTGGGTCCACGGTATTCACTTCAAGTGGGCGTCTGGAGAATGGAACCAGCCCTGTGAACATATCGGCAAATCAGATAAAGGTGGATGCCAACCGTAACATTTTGCATGCTAAGGCAGCTGTTCCTTATGTTGTTAGTGCACATCAAGCAGCCTATGGTCAACCACTTACCATGGCACATATCAAAGGGAGGGAAACTGATACACGAGCTATGTCTGAATTGAACGGTGCTCTTGACAAAAAGGAAGCTTTATTGATGGAGCTCAGAAACACGAACAATGACATATCAGAAAATCGAAATGGAGAAAGTTGTTTAAAAGATTCTGAACCTTTCAAGAAGCATATTGCCACGGCTTCTTCTGCTTTAGTTAACTTGAGACAACAAAATGCTTACCCAGCAAACCCCCTGTCACCTTGGCAGAAACCCCCAACCAATTCCAACTTCTTTGGTGGCTTGAAAAGTTATGTTGACAGTTCTCTTGTCTCACCAGAATCAGGATCTGGTGTGGGTGAAATTGTTCAAGGCTCAAGACTAAAGGCGCATGCTATGGTGGATGCTGCTATGAAGGCCATGTCATCAATGAAGGAAGGCGAAGATGCATTTATGAGGATTGGAGAAGCTTTGGACTCTTTAGATAAACGGCAATTCACATATGACATTAGGATGCCGGTGATCAAGTCACGAGAGCAGGAGAATGGCAGTATGGATTATCGCAATCACTTGGTTTCCTGTACATCAAAACCGGTGGCTGCCGGTTGGGCAACTAATCCCAAGTCGCAGGAGGCTTCTGACAAAAACGAGGAACAAGGTCCTTCAGAGCTGATCGCATCATGTGTTGCTACTTTGCTCATGATACAGACATGTACAGAGCGACAATATCCGCCAGCAGACGTGGCTCAAATAATCGATTCAGCTGTTACAAGCTTGCATCCATGTTTCCCCCAGAACCTGCCAATTTACCGAGAAATACAAATGTGCATGGGGAGGATTAAGACTCAAATATTAGCTTTGATACCCACTTGA
Protein:  
MAPTRKSKSVNKRYSSVYEVSPDKDASNSSKNKPKKLADKLGSQWSKEEIERFYKAYREYGKDWKKVAAAVHNRSTEMVEALYLMNRAYLSLPDGTASVIGLIAMMTDHYSVLRGSDCERESNEPSEIPQKAQKRKRAKVHLGASKEGVVQPQSIASSQGCLSLLKRAGLNGIHPHAVRKRTPRVPVSYSYRRNDTESYIPPNKRVKKSDADDNDAEHVAALTLTGALQRGGSPQVSQTPYKRAECRRSSPVQSYDRTSPQPETTKAKLDDSSYECWMEGRPRGTEPVIGTHARDADPLMDMEVVGTIEGHRKGKKFYRKKMKVEETKNNLSDDGGEACSGTEERIRGSTLKGKVDMEITSAKSEQLSPWSQRKRSNKKLVFGDESSSIDALLTLANLSTSMLPTSIMESESSVKLKENRITLESVDKSSAPEAASTSHHRDNIKHLRPNEKVLDSITGAEEATTRKLKVGRNSAMDDNVVSEAKQKPEPTNNSWKRKRKSFSSKISNAEASMDSHLRQSFDNEDMGEEDNKYLTKGKCGAQSSVQSRQWKSFRVSEDSSTNDDPKMAGIDSVVLTSQVPAPNPVSVPPKHQSRRKMNLRRAFLSTDRSSSKCTLKNQPIKQSVTQDRLKEQLSSCLSSNLARRWCCFEWFYSAIDYAWFAKREFVEYLNHVGLGHVPRLTRVEWGVIRSSLGKPRRFSERFLHEEREKLKHYRESVRQHYSQLRVGAREGLPTDLAYPLSVGQQVIAIHPKTREVHDGKVLTVDHDRCRVQFDSPELGVEFVMDIDCMPLNPLENMPEALRRQNLAFDKFSVTPKPSQVNSHSDFGGSTVFTSSGRLENGTSPVNISANQIKVDANRNILHAKAAVPYVVSAHQAAYGQPLTMAHIKGRETDTRAMSELNGALDKKEALLMELRNTNNDISENRNGESCLKDSEPFKKHIATASSALVNLRQQNAYPANPLSPWQKPPTNSNFFGGLKSYVDSSLVSPESGSGVGEIVQGSRLKAHAMVDAAMKAMSSMKEGEDAFMRIGEALDSLDKRQFTYDIRMPVIKSREQENGSMDYRNHLVSCTSKPVAAGWATNPKSQEASDKNEEQGPSELIASCVATLLMIQTCTERQYPPADVAQIIDSAVTSLHPCFPQNLPIYREIQMCMGRIKTQILALIPT